LP IMPROVING BOUNDS FOR AVERAGES ALONG CURVES 

TERENCE TAO AND JAMES WRIGHT 



Abstract. We establish local [L^,L'^) mapping properties for averages on 
curves. The exponents are sharp except for endpoints. 



1. Introduction 



Let n > 2, and let Mi and M2 be two smooth n — 1-dimensional manifolds , each 
containing a preferred origin Omi and Om2- We shall abuse notation and write 
for both Oa/i and M2 ■ For the purposes of integration we shall place a smooth 
Riemannian metric on Mi and M2, although the exact choice of this metric will 
not be relevant. All our considerations shall be local to the origin 0. 

We are interested in the local improving properties of averaging operators on 
curves. Before we give the rigorous description of these operators, let us first give 
an informal discussion. 

Informally, we assume that we have a smooth assignment X2 7^2 taking points in 
M2 to curves in Mi, with a corresponding dual assignment xi 7*^^ taking points 
in Ml to curves in M2, such that 



We then form the operator R taking functions^ on Mi to functions on M2, defined 



where the amplitude function a{xi,X2) is just a smooth cutoff to a neighborhood 
of (a;i,a;2) — (0,0), and integration on 72,2 is respect to the induced Riemannian 
metric from M2. This operator has an adjoint 
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where the amphtude function a* is another smooth cutoff to {xi,X2) — (0,0). To 
keep things from being vacuous we shall assume that G 70 (or equivalently that 

e 7o*)- 

A basic example of such an operator is the two-dimensional Radon transform, in 
which n = 3, Mi is the plane R^, and is the space of lines in (endowed 
with some reasonable Riemannian metric, and with some line through the origin 
designated as the line), with 7 given by the tautological map 7/ :— I. The dual 
curve 7a; then consists of those lines in M2 which contain x. The Radon transform 
is then the familiar operator 

Rf{l) = jj. 

Another example of such an operator is that of convolution with a fixed curve 
7^ C R""\ for instance % := {{t, ^^ . . . ,t"'"^) : t e R}. We then define Mi := 
M2 := R"~"^ and set 73,2 := X2 — 7o, so the dual curve is 7*^ = x\+ 7q. We can 
then consider operators R of the form 

Rf = f^da 

where da is a fixed smooth compactly supported measure on 7q. 

This class of operators was first introduced in [11], see also [21], [24], [23]. We are 
interested in determining the exponents 1 < p, 9 < cxd such that R maps U'^Mi) 
to i^(M2) locally in a neighborhood of the origin, perhaps shrinking the support 
of the cutoff a if necessary. As will become clear, such a mapping property is easy 
to show if g < p; the interesting case is when q > p. Observe that such a mapping 
property is essentially independent of the metric structure on Mi, M2 or on the 
cutoff a (assuming the support is sufficiently small), and is indeed invariant under 
diffeomorphisms of Mi and M2. Thus we expect the set of exponents {p,q) with 
this mapping property to depend only on diffeomorphism-invariant properties of 
the family of curves X2 t-» 'yx2 ■ 

Having given an informal description of our problem of interest, we now give a 
formal setup (based on that in [11], [21]; a prototype of this "double fibration" 
formulation appeared earlier in [23]) which will be more convenient to work with. 
In addition to the n — 1-dimensional manifolds Mi and M2 given earlier, we shall 
also work with an n-dimensional smooth Riemannian manifold S with another 
preferred origin = Oe. We will also assume we have smooth maps tti : E — !■ Mi 
and 7r2 : S ^ M2 which map to 0, and are submersions (i.e. the derivative has 
maximal rank n — 1) in a neighborhood of 0. 

We then define the transform R by duality as 

/ Rf{x2)g{x2) dx2 = / f{xi)R*g{xi) dxi 

Jm2 J Ml 

f{ni{x))g{7T2{x))a{x) dx 
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where dx is the measure on E induced by the Riemannian metric, and a{x) is a 
smooth cutoff to a neighborhood of 0. (We shall always use x to denote elements 
of E, and xi, X2 to denote elements of Mi, M2 respectively). 

To connect the above formalism with the intuitive discussion given earlier, we as- 
sume that the assignment X2 1— >■ 7x2 and the amplitude function a{xi,X2) is given, 
and then define E to be the n-dimensional manifold 

S := {{xi,X2) e Ml X M2 : e 7xJ = {{xi,X2) € Mi x M2 : X2 € 7xJ 

with Oe := (OmijOms)- We then define tti and ^2 to be just the co-ordinate pro- 
jections TTi(xi,X2) '■= xi, 'K2{xi,X2) '■= X2. Thc rcadcr may easily verify by the 
change-of- variables formula that the transform given by (1) then obeys the formula 
(2), if the support of the cutoffs are sufficiently small. In particular the examples 
of thc Radon transform and convolutions with curves given previously can be put 
into the above framework. 

The class of operators of thc form (2) is actually slightly larger than that given 
by (1), as it allows for the curves 7x2 and 7*^ to develop cusps. In general it may 
happen that the kernels of £>7ri(0) and D-k2{^) agree but this cannot be the case 
in the setting of (1) where E is a submanifold of Mi x M2 and tTj : E — > Mj are 
the restrictions of the co-ordinate projections from Mi x M2. Certain examples of 
fractional integration along curves fall within the more general class of operators 
given by (2). (See the remarks section. Section 10). 

Definition 1.1. Let 1 <pi,P2 < 00. We say that the sextuple (E, Mi, M2,7ri,7r2,0E) 
is of strong-type (pi,P2) if 'we have an estimate of the form 

I / /i(7^i(2;))/2(7r2(a;))a(a;) dx] < ||/i||iPi(Mi)||/2||LP2(M2) 

for all fi , f2 and all cutoff functions a supported in a sufficiently small neighbour- 
hood of 0. (See the Notation section for the definition of <). 

In the intuitive setting, the above definition (modulo irrelevant technicalities when 
P2 = 00) corresponds by duality to R mapping I/P^(Mi) to LP^{M2), where p' is 
the usual dual exponent to p, l/p+ 1/p' = 1. 

We are interested in the problem of determining, for a fixed sextuple (E, Mi, M2, tti, 7r2, Oe), 
the set of exponents Pi,P2 for which the sextuple is of strong- type (pi,^!,)- This 
problem is one aspect of a niuc;h more; general and difficult problem concerning 
smoothing estimates for (possibly singular or maximal) Radon transforms; the field 
is too vast to summarize here, but we refer the reader to the survey papers [48], 
[37], [36], [16], and to the recent papers [8], [47]. 

We will in fact work with a more convenient setting, that of restricted weak-type 
estimates. 

Definition 1.2. Let 1 <pi,p2 < 00. We say that the sextuple (E, Mi, M2, tti, 7r2, Oe) 

is of restricted weak-type {pijpi^) if we have an estimate of the form 

\ j^XEAM^))XEMx)Hx) dx\ < \Ei\'/P^\E2\'/P^ (3) 
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for all measurable subsets Ei C Mi, E2 C M2 and all cutoff functions a supported 
in a sufficiently small neighbourhood of 0. Here we use \E\ to denote the measure 
of E with respect to the Riemannian metric. 

Note that we have the trivial bounds of 0(|i?i|) and 0{\E2\) for the left-hand side 
of (3) just from the hypothesis that tti and 7r2 are local submersions. Thus we 
automatically have restricted weak- type when p'2 < pi- (A variant of this argument 
also gives strong- type in the same region). Henceforth we restrict ourselves to the 

case pi < p'2 ■ 

Clearly strong- type (^1,^2) implies restricted weak- type (^1,^2)- Conversely if 
1 < Pi < p'2 < 00 and one has restricted weak-type in an open neighborhood of 
(pijpJ,)) then the Marcinkiowicz interpolation theorem gives strong- type (^1,^2). 
Thus if one is willing to lose endpoints, it suffices to consider the restricted weak- 
type problem. 

We can reformulate Definition 1.2 in a more geometric manner which is more con- 
venient to work with. 

Proposition 1.3. Let I < Pi < p'2 < 00. Then the sextuple is (S, Mi, M2, tti, 7r2, Os) 
is of restricted weak-type (^1,^2) */ (^''^d, only if one has the estimate 

M<\nim'/P^\n2m'^^^ (4) 

for all sets fl CT, in a sufficiently small neighborhood of 0. 

Proof If (4) holds, then (3) follows by setting Vl := 7rf n7r^\S2) nS, where 
B is a sufficiently small neighborhood of 0. Conversely, if (3) holds, then (4) follows 
by setting E\ = 7ri(0) and E2 = 772(0). ■ 

Thus, the question is to determine to what extent the size of a set f2 in S is con- 
trolled by the size of its two projections 7ri(f7) C Mi and 7r2(0) C M2. Intuitively, 
the answer to this question should somehow depend on how "independent" the pro- 
jections TTi and 772 are; for instance, if Mi = M2 and tti = 772 it is easy to see that 
there are no estimates of the form (4) other than the trivial ones when p'2 < pi- 

To make the notion of "independence" more precise we introduce two vector fields 

Xi and X2 on S. For j = 1,2, we let Xj be a smooth vector field defined on a 
neighborhood of Os such that Xj is never zero, and Xj always lies parallel to the 
fibers of iVj, or equivalently that the push- forward {'Kj)t,Xj of Xj is identically zero. 
(Equivalently, Xj(x) is always a non-zero clement of the one-dimensional kernel 
of the derivative map D'!Tj{x) : ^ T^.i^^-^Mj). The vector field Xj is only 

defined up to multiplication by smooth non-zero scalar functions, but we will not 
be bothered by this freedom and just work with a fixed choice of Xi and X2. From 
the classical Picard existence theorem for ODE we can see that such vector fields 
are guaranteed to exist in a suSiciently small neighborhood of 0. 
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With these vector fields Xi , X2 one can then define the flow maps e*^ : S ^ E in 
a neighborhood of for sufficiently small t and j = 1, 2 by the ODE 

le*^^- {x) = Xj (e*^^ (x)); e°^' (x) = x. 

These maps are smooth and form a group in a neighborhood of 0, i.e. e*^^ e*^^ = 
g(s+t)X3 pg^jj interpret the Xj as first-order differentiation operators by the 
formula 

The set of first-order differential operators with smooth co-efficients is closed under 
Lie brackets. We denote [Xi,X2] by X12, and observe the formula 

^12^ := -^e-*i^ie-*^^^e*i^ie*^^^(x)|t,=t,=o. (5) 
atiat2 

More generally, we define vector fields Xyj for all words w: 

Definition 1.4. We define a word w to be any non-empty finite ordered collec- 
tion of Is and 2s. We define the degree deg(u') € N x N to be the ordered pair 
(deg(w)i, deg(w)2), where deg{w)j is the number of occurrences of j in iv. We 
define the vector fields Xyj fov w &W recursively by Xi := Xi, X2 ■= X2, and 

Xwj ■= [Xw, Xj] for all j = 1,2 and w gW, 

thus for instance 

^12211 = [[[[Xi, X2], X2], Xi], Xi]. 
We give a partial ordering on degrees by writing (01,02) < (61,62) when ai < bi 
and 02 < 62- 

Note that there exists a neighborhood of for which all the vector fields X^^ exist 
and are smooth. At first glance it may appear that these vector fields X^ do not 
cover all possible Lie bracket combinations of Xi and X2, but the Jacobi identity 
allows one to write any other Lie bracket combination as a linear combination of 

the Xw 

We say that Xi and X2 obey the Hormander condition at if there exist words 
Wi, . . . ,Wn G W such that X^^(O), . . . ,X^^{Q) span the tangent space TqS at 0. 
This condition has appeared several times before in work on Radon-like transforms, 
most notably in [8] (see also [47]) where the following result is established. 

Proposition 1.5. If Xi and X2 do not obey the Hormander condition at 0, then 
(S, Ml, M2, TTi, 7r2, 0) is not of restricted weak-type {piip'2) for any 1 < Pi < p'2 ^ 
00. In particular, there are no non-trivial mapping properties near 0, either of 
strong type or of restricted weak-type. 

For completeness we give a quick proof of this result in Section 2, as a consequence 
of the machinery developed in [8]. 

It remains to consider the case when Xi and X2 do obey the Hormander condition, 
and we shall assume this for the rest of the Introduction. 
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Definition 1.6. We define VF" to be the space of all n-tuples {wi,... ,Wn) of 
words. If / = {wi, . . . , Wn) is an n-tuple, we define the degree deg(/) e N x N by 

deg(/) = deg(wi) + . . . + deg(u;„) 

and define the function A/ : S — > R by 

Xi{x) := det{Xw^{x), . . . ,X^„(a;)). 

There exists a neighbourhood of where the functions A/ are all well-defined smooth 
functions. The Hormander condition thus asserts that there exists an n-tuple Iq 
such that A/(,(0) ^ 0. Henceforth we fix this n-tuple Iq. 

To relate these n-tuples / to our problem (4) we introduce two-parameter Carnot- 
Caratheodory balls, in the spirit of [27]. 

Definition 1.7. Let x e S be in a sufficiently small neighbourhood of 0, and 

let < 61,62 <C 1 be sufRciently small numbers. Wc define the two-parameter 
Carnot-Caratheodory baW B{x; 61,62) to be the closure of the set 

{e*'^'^' ...e*>'^>'^''(x) :fc>0;|ti| + ... + |tfe| < 1} 

where we adopt the periodic convention that 6j+2 '■= 6j and Xj^2 '■= Xj. 

Informally, the ball B{x; 61,62) represents (up to constants) the set of all points 
which can be reached from x by flowing for an amount 61 in the Xi direction, and 
62 in the X2 direction. From (5) we heuristically expect that we can also flow by 
^1^2 in the X12 direction and still stay inside (a constant dilate of) this ball; more 
generally, we expect to flow by ^^''s("') in the Xyj direction, where we adopt the 
notation 

Because of this, we heuristically expect B{x; 61,62) to look something like the con- 
vex hull of the points x ± 6'^'^^^^^Xyj. Following these heuristics, we will eventually 
be able to obtain a volume estimate (inspired by a similar formula in [27]) which is 
roughly of the form 

\B{x;6,,62)\^Y.^'"'^^'^\^i^^^\ (6) 
I 

where I ranges over a flnite set depending on Iq and on certain restrictions on 
and 62- 

The balls B{x; 61, 62) form excellent examples of sets O to test (4) with. Indeed, 
one heuristically expects 

\ ror X X \\\ \B{x;6i,62)\ 
\iTj{B{x;6i,62))\ ~ ^ 



We chose this definition for the introduction as it is the most intuitive to visuahze. How- 
ever, in our rigorous argument we shall avoid these balls, and work with an equivalent family of 
balls defined by exponentiating various weighted commutators of the Xi. See [27] for a detailed 
comparison of the two types of balls. 
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for j = 1, 2. Inserting this into (4) with Q := B{x; 61,62), we obtain the inequahty"* 

\B{x; 61, 62)1 > 61^6^^' 
as a special case of (4), where the positive numbers ci, C2 are given by 

Cl :=P2/(Pl +P2 -P1P2), C2 ■.= Pl/{pi+P2-piP2)- (7) 

Comparing this with (6), wc are thus led to the fohowing Conjecture. 

Definition 1.8. We define the Newton polytope P of Xi and X2 at to be the 
closed convex hull of the set 

{a; G R+ X R+ : a; > deg(/), A/(0) ^ for some / G W}. 

Note that P is indeed the Newton polytope of the right-hand side of (6), if one 
thinks of this as a Taylor series in 6\ and 62- 

Conjecture 1.9. Let (S, Mi, M2, tti, 7r2, 0) and Xi, X2 obey the Hormander con- 
dition, and let 1 < pi < p'2 < 00. Then we have strong type (pi,P2) */ '^"'^ only if 
the point {01,02) defined by (7) lies in the closed polytope P. 

The main result of this paper is that this conjecture is true away from endpoint 

cases. 

Theorem 1.10. Let the notation be as in Conjecture 1.9. Then we have strong 
type {pi, p2) when {01,02) lies in the interior of P. Conversely, if {01,02) lies in the 
exterior of P, then one does not even have restricted weak-type {pi,p'2)- 

As one can verify from elementary algebra, the statement that {01,02) lies in P is 
equivalent to (l/_Pi, '^/p'2) lying in the closed convex hull of 

{(0, 0), (1, 1)} u rn'^H'Tn ^ ^ e W^", A,(0) ^ 0}. 

deg(/)i + deg(7)2 - 1 deg(7)i + deg(/)2 - 1 

Similarly, the statement that {01,02) lies in the interior of P is equivalent to 
I /p'2) lying in the interior of the convex hull of (8). 

For curves in the plane (n = 3), Scegcr [47] has obtained Theorem 1.10 previously 
for the operators R in (1) (here S is a submanifold of Mi x M2 and the TTj 's are 
the induced co-ordinate projections from Mi x M2). In this case, Xi and X2 are 
necessarily linearly independent at and so the 3-tuples / G with A/(0) 7^ 
defining (8) reduce to looking at iterated commutators Xw,w G W, which do not 
lie in the subspace spanned by X\ and X2 at 0. See [47] for details. 

We now give some examples of Theorem 1.10. 



^Equivalcntly, one can obtain this bound by applying the original averaging operator R to the 
characteristic function of ■k\{B{x; 5\, &2))- These sets (which tend to look like squashed neighbor- 
hoods of curve arcs) are well-known as test sets for these averaging operators; see e.g. [6], [14], 
[47]. 
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Convolution with curves. We consider operators An mapping functions on R" ^ 
to R"-^ defined by 

Anfix) = J fix- 7(i))«(i) dt 

where a is a smooth cutoff in a small neighborhood of and 7 is a smooth curve 
in R"^^ with 7(0) = and 7'(0) ^ 0. This operator is of the above type with 

Mi:=M2:=R"-\ S := {(x, : a; € R"-\ i € R}; Omi := Om^ := Os := 

and 

'Ki{x,t) := x; 'K2{x,t) := x + 'y{t). 
We can select the vector fields Xi, X2 as 

Xi:=dt; X2:=dt-i{t)-y^. 
A routine calculation shows that 

X12 = -i'{t) ■ V, 

and more generally that 

for any word Wi . . .Wk, where e = — 1 if W1W2 = 12, e = +1 if W1W2 = 21, and e = 
otherwise. 

Let us now specialize to the maximally curved case, when 

7(0),7'(0),... ,7^"-'H0) spanR"-!. 
This for instance is the case when 

7(^):=(^,^^...,^"-l). (9) 

Then we see that the vector fields 

Xi,X2, Xi2, Xi2w3, • • • ; Xl2w3...Wn-i 

span R"-i X R at zero, for arbitrary W3 . . . Wn-i £ {1, 2}. Furthermore, any other 
n-tuples of vector fields Xyj which span have equal or larger degree (with respect 
to the partial ordering > on degrees). A simple computation then shows that the 
convex hull of (8) is the trapezoid with vertices 

. , - 3n + 4 n - 2 , ,2 2n - 4 , ,^ ^ , 
(0,0), ( 2 , ), (-, ), 1, 1 . 

— n n n n — n 

Thus our theorem gives strong- type [L^^ , L^^ ) in the interior of this trapezoid, and 
failure of restricted type (i^S L^^ ) outside this trapezoid (when 1 < pi < p'2 < 00). 
This result was already obtained by Christ [6] (with earlier work in lower dimensions 
by [25], [29], [31], [32], [14], [17]), at least in the polynomial case (9); in fact these 
papers go further and prove various strong-type or restricted-type estimates on the 
boundary. 

Restricted x-ray transforms. We consider operators of the form 
Rnf{x',s) := J fix' + t-f{s),t)a{x',s,t) dt 

where n > 3, cc' € R"~^,7(s) = (s, . . . ,s"~^),s,t € R and a{x',s,t) is a bump 
function. This is the X-ray transform on R"~^ restricted to the line complex 
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of lines whose direction lies in the curve {(7(5),!) : s G R}. This is a model 
example of operators studied for instance in [10], [22], [18], [19], [20], [7], [14], 
[17], [33], and elsewhere. This operator can be placed into the above setting with 
Ml := {{x',t) G R"-^}, M2 := {ix',s} G R""^}, and 

S :={(x',i,s) gR"} 

with being the usual origin and the projection maps tti, 7r2 defined by 

TTi{x' , t, s) := {x' + t-j{s), t); n2{x' , t, s) := {x' , s). 

The associated vector fields Xi, X2 can thus be chosen to be 

A computation then shows that 

X12 = -X21 = 7'(s) ■ V,' 
X121 = -X211 = -7"(s) ■ V,' 
X1211 = -X2111 = 7"'(s) • V,' 

and so forth. Furthermore, all other commutators are zero. Thus, we essentially 
only have one spanning set of vector fields: 

-'^l, -^^2, -'^12, -^121, • • • ,-'^121...1, 

where the last vector field has n — 3 consecutive ones. The total degree of this 
n-tuple can then be computed to be 

,n2-3n + 4 ^. 
( ^ ,--1). 

The convex hull of (8) is thus the triangle with vertices 

and so our Theorem gives mapping properties in the interior of this 

triangle and failure of outside this triangle. This agrees with previous 

results in [13], [14], [17], [33], [7], although the results there include certain endpoints 
which are not obtained by our methods. 

Folds and cusps. When S C Mi x M2 is a submanifold and the tt^ 's are the in- 
duced co-ordinate projections, the underlying geometry is sometimes best expressed 
in terms of the microlocal picture 

C 

T*Mi \ r*M2 \ 

where C = 7V*S' = {{xr,^i;x2,^2) e T*(Mi x M2) \ : (a, -6) ^ 
the twisted conormal bundle of E, is the canonical relation in T*Mi x T*M2 (we 
are using perhaps nonstandard notation here). When n > 4, there must be points 
in C where D7t2 and Diti drop rank. The simplest singularity that can occur is 
a Whitney fold (5*1,0 in the Thom-Boardman description) and the second simplest 
stable singularity for tti or ^2, after folds, is a Whitney cusp; singularity class 
5'i,i,o := Si^fi- 
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For the restricted x-ray transform i?„ above, tti has only singularities that belong to 
the Morin singularity classes 6*1^,0, 1 < A: < n — 3 (see [15] for a discussion of these 
singularity classes and their connections with the operators considered here), while 
both projections tti and n2 for the convolution example An (when 7 is maximally 
curved) have singularities at most of type Si^^o, 1 < k < n—3. We refer to situations 
as one-sided when conditions are imposed on one projection while no assumptions 
are imposed on the other projection. On the other hand, situations for which both 
TTi and TT2 belong to a given singularity class are referred to as two-sided. Thus the 
geometry underlying A4 is a two-sided fold while that of A5 is a two-sided cusp. 
See [22] for the microfocal analysis of the restricted x-ray transform. Also see [13], 
[14], [16], [39] for more details. 

To see examples of how this can be expressed in our setting, consider operators of 
the form 

Rf{x) = j f{^{x,t))a{x,t)dt (10) 

where n > 3,7 : R"^^ x R ^ R"^^ is a smooth map with j{x,t) = x and a{x,t) 
is a bump function. Thus x 1— > 7t(x) := 'y{x,t) is a family of diffeomorphisms for 
small t and a result in [8] associates to {jt} a unique sequence of smooth vector 
fields Zi, Z2, . . . such that 7t(a;) ^ exp(^ PZj){x) to infinite order as f — > 0. This 
operator can be put into the above setting with Mi = M2 = R"~^,S = {{x,t) : 
X e R"~^,t e R}, with being the usual origin and 

7ri(a;,t) = 7t(a;), TT2{x,t) = x. 

(Equivalently, if f^: ^ 0, one could take E = {{"f^{x),x) : x e R""\t e R} so that 
S C Ml X M2 is a submanifold containing the diagonal.) The associated vector 
fields Xi and X2 can be chosen to be 

Xi = W{x,t)-V^ + dt: X2 = dt 

where W{x,t) = ^7*+^ ° 7t(a;)U=o- Observe that X12 = [Xi,X2\ = W'{x,t) ■ 
where ' denotes differentiation with respect to Similarly X122... 2 = {—l)''W^''\x,t) 
Vx where k is the number of 2's. 

In this setting, when n = 4, a result of Phong and Stein [39] states that tt2 (rcsp. tti) 
has at most Whitney fold singularities iff Zi,Z2, and Z3 + ^[Zi, Z2] (resp. Zi,Z2 
and Zz — \ [Zi, Z2]) are linearly independent (A similar result holds for cusps, see 
[14]). A simple computation, using the Bakcr-CampbcU-Hausdorff formula, shows 
that Zi,Z2 and Z3 + ^[Zi,Z2] are linearly independent at x iff W{x,0),W' (x.O) 
and W"{x,0) are linearly independent and this is equivalent to 

A/(a;,0) := det(Xi,X2,Xi2,Xi22)(x,0) ^ 0. 

(More generally, for any n > 4, a theorem of Greenlcaf and Seeger [15] gives the 
equivalence between the linear independence of the {W^''^}Q<k<n-2, the linear in- 
dependence of some combination of the Zj's, and 7i"2 having only (strong) Morin 
singularities, Si^ q, fc < n — 3.) Since / = (1, 2, 12, 122) has degree (3, 4), Theorem 
1.10 implies that when n = 4 and 7r2 has at most Whitney fold singularities, the 
operator R maps L^^ to L^^ whenever (1/pi, l/pj) lies in the interior of the tri- 
angle with vertices (0,0), (1, 1), (1/2, 1/3). This is a particular case of a theorem 
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of Greenleaf and Seeger [13] which requires an additional hypothesis but contains 
estimates on the boundary of the triangle. A similar result holds when #1 has at 
most Whitney fold singularities. 

The restricted x-ray transform provides an example of the sharpness of this one- 
sided fold theorem. Another interesting example is the following one considered by 
Secco [45]. Let 

-ft{x) = x- {t,t^,at^) 

where • denotes the 3-dimensional Heisenberg group multiplication. Then W{x, t) = 
(-1, -2t, -{3a + l/2)t2 + 1/2x2 - xit) and so 

X12 = 2da:, + ((6a + l)t + Xi)da:„ 

X122 = -(6a + l)dxs, ^121 = (1 - 6a)9x3- 

All other iterated commutators are zero. When a = 1/6, Theorem 1.10 shows 
that the corresponding averaging operator (10) does not map Lp^ to L^'^ whenever 
(1/pi, l/p'2) lies outside the triangle with vertices (0,0), (1, 1), (^, \). See [45] for 
these results including endpoint estimates. 

A nondegenerate example R^. As the final example, we consider an operator 
of the form (10) when n = 5 and where 

7(a;, t) = {xi +t,X2+ 1^, X3 + 1^, Xi + t^ + X2t). 

One easily computes that W{x, t) = (—1, —2t, — 3t^, —At^ — X2 + 2t^) and so at the 
origin we have 

Xi = —d^i + dt, X2 = dt, X12 = 2dx2, 
X121 = -69^,3 + 2da:^, X122 = -69^,3 + 45a;4, 

-'''1211 = -'''1212 = -'''1221 = -'''1222 = 249^;^; 

all other commutators being zero. Therefore the only 5-tuples of iterated commu- 
tators that contribute have degrees (5, 5), (7, 4), (6, 5), (5, 6) and (4, 7). The convex 
hull of (8) is then the pentagon with vertices 

(0^0),(±i).,^.l),(l,l),(l.l)^ 

Our Theorem thus gives strong type {L^^ , L^^) in the interior of this pentagon, and 
failure of restricted weak type {L^^ , L^^ ) outside. This pentagon is a larger region 
in (^, ^) space than the corresponding one for the operator considered above. 

In the remarks section (Section 10) we explain the difficulties in extending our argu- 
ments for Theorem 1.10 to the endpoint cases. After some geometric preliminaries 
in Sections 3, 4, we will prove the first part of Theorem 1.10 in Section 5, and the 
second part in Sections 6-9. 

Wc would like to thank Mike Christ, Allan Greenleaf, Andreas Sccgcr, and the 
referee for several useful discussions and comments. The authors also thank Mike 
Greenblatt for pointing out an error in the first version of this manuscript. 
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2. Proof of Proposition 1.5 



We first give a proof of Proposition 1.5. The idea is to use a quantitative version of 
the proof of Frobenius's theorem given by Theorem 8.8 in [8]. In our language, this 
theorem asserts (among other things) that if Xi and X2 do not obey the Hormander 
condition at 0, then for any N > there exists a submanifold S* C S containing 
of positive codimension such that for all < J <gC 1 and all a; G 5, t G R with 
dist(a;, 0) < (5 and \t\ < (5 we have 

dist(e*^^a;, S), dist(e*^^a;, S) < 6^. (11) 

In other words, S is "almost invariant" under the flow of Xi and X2- The proof of 
this claim in [8] uses a quantitative form of the Baker-Campbell- Hausdorff formula; 
it is also possible to proceed using the machinery in this paper but we will not do 
so here. 

Assuming this statement, we test (4) with the following neighbourhood of S: 

n := {a; e S : dist(a;, 0) < 6; dist(a;, S) < 6^}. 
If S has codimension k, then we have 

On the other hand, from (11) and the smoothness of Xi, X2 we see that the set 

{e^^^x ■.x&n-\t\< 5} 

is contained in a set which is essentially the same as Q. (but with different implicit 

constants in the definition) and so also has measure ^ §n-k^Nk_ From Fubini's 
theorem and the fact that tti is a submersion we thus have 

ki(n)| < MI6. 

Similarly we have 

Inserting these facts into (4) we obtain the condition 

^n-k^Nk <; ^jn— jJV/c^— l^l/pi-|-l/p2 . 

since 5 can be arbitrarily small, this forces 

(n - /c + Nk) >{n~-k + Nk- 1)(— + — ). 

Pi P2 

Since N can be arbitrarily large and k > 0, we thus obtain 

l + l<i 

Pi P2 

and the claim follows. 
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3. Notation 

In our argument the scxtuplet (S, Mi, M2, tti, 7r2, 0) shall be fixed. We assume that 
Xi , X2 obey the Hormander condition. By continuity we may find a fixed n-tuple 
Jo and a fixed open neighbourhood V C S of such that the vector fields are 
all smooth and well-defined on V, and A/^ is bounded away from zero on V. Unless 
otherwise specified, the symbol x will always be assumed to denote a point in V. 

In our argument we will need a large parameter N ^ 1, which will eventually 
depend on Iq, pi, P2, and we will also need a small parameter < e <C 1, which will 
eventually depend on A'' and Iq, Pi, P2- For the first part of the paper, in which we 
develop the theory of Carnot-Carathcodory balls, we shall only use the parameter 
e; the iV parameter will be used later when we begin to prove (4). 

We use C to denote various large numbers which depend on Iq, pi, p2, V, and 
the sextuplet (E,Mi,M2,7ri,7r2,0). We use Cjv to denote various large numbers 
which depend on the above parameters as well as N, and C]v,e to denote various 
large numbers which depend on the above parameters as well as N, and e. We will 
always assume implicitly that these constants have been chosen sufficiently large. 

Let A, B be positive quantities. We use A < B or A = 0(B) to denote the estimate 
A < CN,eB, and ^ < B to denote the estimate A < Cj^^^B. We use A ~ B to 
denote the estimate A < B < A. Later on we will introduce a variant notation 
A<B. 

Thus for instance we have 

A/„(a;) ~ 1 for all a; e V. (12) 

We also let K he a. large constant (depending only on Iq, pi, p2, N, e, and the 
sextuplet) to be chosen later; our constants C, Cn, Cjv,e will not be allowed to 
depend on K unless explicitly subscripted. 

4. TWO-PARAMETER CaRNOT-CaRATHEODORY BALLS 

Wc now set up the machinery needed to study the Carnot-Carathcodory balls. Our 
arguments here are very much inspired by the beautiful paper of [27]. However 
the results in [27] are phrased for one-parameter balls and in order to exploit them 
one would have to reproduce many of the arguments for the two-parameter setting. 
This would have added no new insights beyond those already in [27], so we have 
adopted a different approach - based more on Gronwall's inequality than the Baker- 
Campbell-Hausdorff formula - to give the same type of results. 

In this section we fix an xq e V, which we assume to be sufficiently close to 0. 
We also assume S = {61,62) to be an arbitrary pair of numbers which obey the 
smallness condition 

0< 61,62 <C^],^K, 
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and the non-degeneracy condition 

These will be the two radii for our two-parameter Carnot-Caratheodory balls. As 
in the introduction we define 

and 

for any pair (ni,n2). 

We let lew denote the set 

I:={/eW^":dcg(/)<-dcg(/o)}. 

e 

Observe that I is a finite set containing Iq, and that from the smallness and non- 
degcncracy conditions of <5i, ^2, 

(^^)deg(7) < (^^)deg(7o) for all I ^ I. (13) 

Define the vector-valued function A by 

Aix) := iiKSr'^^^'^Mx)),^^. (14) 

We shall see later that |A(a;)| controls the volume of B{x; Si, 62), if x is sufficiently 
close to 0. 

From (12) we have 

|A(x)| > (i^J)'*°s(/o). (15^ 

from (13) and the smallness of 61, 62 wc thus have 

{KSf'^^^'^\Xiix)\<\Aix)\ (16) 
for all / € W^, with the implicit constant here allowed to depend on I. 

Fix an Xq € V and observe from (14) we may find an n-tuple Ixq = {wi, . . . , Wn) 
in I such that 

|(if5)deg(/.o)A,^^(^o)|^|A(xo)|. (17) 
Fix this . Define the map $ = i^om to V by 

71 

^h,...,tn) ■.= eMY.^jK-\K5f^^^^^^X^.)xo (18) 

and consider the "balls" ^{B), where i? is a ball in R" ccintcrcd at the origin with 
fixed radius C, which we will choose later. It turns out that these sets are roughly 
comparable to the balls B{x; 1^2) defined in the Introduction (for a detailed dis- 
cussion, see [27]). In fact in the proof of Theorem 1.10 we shall work exclusively 
with the ^{B) and avoid using the balls B{x; 5i, ^2)- 



Since $ is smooth and 

9t^.$(0) = ii'-i(ii'^)'^''sKOx, 



(19) 
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we see from (17) that 

det(£>$)(0) ~ii'-"|A(a;o)| 
and in particular that $ is locally invcrtible near the origin. (We will make this 
statement more quantitative later on). 

For each w G W, let be the pull-back of the vector field K'^{K6y'=e'^'"^Xyj by 
the map these are defined near by the local invertibility of though we shall 
shortly show that they are in fact defined on all of B (there is a slight abuse of 
notation here since the Y^^s arc not exactly iterated commutators of Yi and Y2 
but in fact a multiple of the corresponding commutator - e.g., [11, ^2] = KY12 by 
the way we have just defined F12). We also consider the standard Euclidean vector 
fields 9i, . . . ,dn on B, as well as the co-ordinate functions h,. . . ,tn- Prom (19) 
we have 

Y^^{0) = d,{0). (20) 

Thus the system of vector fields F^i ^„ is a perturbation of the Euclidean 
system di, . . . , 9„ near the origin. 

We now come to the main result of this section, which gives regularity of the vector 
fields Yyj and controls the geometry of the ball ^{B). 

Proposition 4.1. The vector fields Yyj are well-defined and smooth on all of B, 
and obey the estimates 

||^u)||c"(B) £ Cw,M (21) 

for all w gW and M >0, if K is sufficiently large depending on M. 

For the vector fields Y^^ we have the more precise bounds 

Y^,{t)=di + 0{\t\/K) (22) 
for t £ B and i = 1, . . . ,n. In particular we have 

det{Y^, , . . . , yu,„)(f) ~ 1 for all t e B. (23) 
Finally, we have the volume bounds 

mE)\ K-^\A{xo)\\E\ (24) 

for any subset E of B. 
Proof 

Step 1. Estimate Y^.. 

The first step is to bound the Y^^, and in particular to prove (22) and (23). 
By (18) and the definition of exponentiation we have the vector field identity 

n n 

j=i j=i 

on B. 
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Let I < i < n. Taking the Lie bracket of (25) with Yuj^, we obtain 

n n 

y ] {tj)dj + tj [l^j ,dj] = ^ ] (tj)Y^^ + tj [Ywi ) ] 
i=i j=i 
which we rewrite as 

n n n 

j=l j=l j=l 

If we write 

n 

Y^,=:di + J24{t)dk (26) 



/s=l 



then the previous becomes 



j=i k=i fe=i j=i k=i j=i (27) 

Define the vector valued quantity 

Mt) ■■= (aK*))i<«j<n; 

from (26), (20) we observe that A{0) — 0. We now try to use (27) to obtain good 
bounds on A. 

From (35) below we see that [i^^,!^^] can be written as a linear combination of 
the Yu,^ with co-efficients 0{K'^^), so that the dk co-efficient of this vector field is 
0((1 + \A\)/K). From (27) we thus have 

n 

\Y,t,{dja'i)+4\<\Af + {r\A\+r)/K 

for all k, where r := {J2iLi ^i Y^^ is the radial co-ordinate. We can rewrite this as 

mra>:)\<\A\' + {r\A\+r)/K 

where dr = X^"=i is the radial vector field. From the definition of A we thus 
have 

\dr{rA)\<\Af + {r\A\+r)/K. 
On the other hand, we have A = when r = 0, so we have the integral inequality 

r\A{ru;)\ < \A{su;)f + ^is\A{su;)\ + s) ds 

for all u! on the unit sphere. In particular we have 

^^<l + ^( sup M^ + (,,p M^).) 

r K 0<s<r S 0<s<r S 

for r = 0{1). If K is sufficiently large, wc thus sec from standard continuity 
arguments (recall that A is smooth and vanishes at the origin) that A is in fact 
defined for all r and that '^^^^''"■'^ < 1 for all rcj £ B. In other words we have 

A{t) = 0{r/K) (28) 

for all t&B, and (22) and (23) follows. 
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Step 2. Control A and A/^^. 

The next step is to understand the behaviour of the vector-valued function A. We 
begin by proving some regularity properties of A in the directions Xyj. 

Let I g1, w € W, X G V he arbitrary. We observe the identity 
Xyj det(X.u,i , . . . , Xyj^ ) =div{Xu,) det{Xyj^ , X^^ ) 

n 

+ ^ ^ det (^uji ) • • • ) \-^w 1 -^wj ] ) • • • ) -^wn ) i (29) 
i=i 

this formula is just the Lie derivative of X^^ A ... A X^^ with respect to the vector 
field Xyj and can be deduced (for instance) by writing everything in co-ordinates 
and using the product rule together with the formula 

n 

{[X,Y]{x))i = - Y^{x)d^Xi{x). 

Prom (29) and the observation (using the Jacobi identity) that [X^, Xyj.] is a linear 
combination of those X^ji with deg(u;') = deg(w) + deg(«;j), we see from (16) that 

|(^5)deg(«,)+deg(/)^^^^(^)| < Cyj\K{x)\ (30) 

Prom this and (14) we obtain 

j)deg(«,)^^^(^)| < C7^|A(x)| (31) 
By iterating (29) we thus obtain 

|(i^5)dcg(«.i)+...+dcg(»,)+dcg(/)^^^ _^^^^^^^(^)| < Cy,^k\Hx)\ (32) 

for any fc > 1; note that any term of the form div(X^) is smooth and so will behave 
very nicely with respect to derivatives. 

Prom (31) and (18) we obtain 

dr\K{m)\ < K-^\Km)\ 

for all t € B. Similarly, from (30) and (18) wo have 

|a^(ii'(5)^'=8;(/xo)^^^^($(^))| < K~^\A{^{t))\. 

From these estimates, Gronwall's inequality, and (17) wc thus obtain 

|(i^J)dcg(/.o)A,^^(3;)| ^ |A(a;)| ~ |A(xo)| (33) 
for all x S 4>(-B), if K is sufficiently large. In particular we have 

A/.o(a;)~A/,^(xo) (34) 

for all xG^{B). 
Step 3. Control F^. 

Having controlled Yyj., and having controlled the determinants A/, wc will now be 
able to control all the other Yyj, and in particular Yi and Y2. As a consequence we 
will be able to prove (21). 
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From Cramer's rule we see that for all w € W we may write 

n 

(jr(5)d«s(-)x^(x) = c^^j{x){K5f^^^^^^X^. {x) (35) 

for all X S where the co-efEcients Cw,j are linear combinations of ratios 

(i^<5)d«s(^)A/(x) 



(Jf5)<ieg(/.o)A,,Ja;) 



(36) 



for various / S W^. In particular, from (16) we see that Cw,j = 0{1) (with the 
implicit constant depending on w). 

Pulling back (35) under $ we obtain 

n 

where c^j := Cyjj o ^. 

Fix M > 0, 1 < j,k < n, w e W and t G B. To prove (21), it suffices by (37) to 
show the bounds 

|Vfc^,,(t)|<CM,. (38) 

and 

|Vf4(0l<CM. (39) 



The claims (38), (39) have already been proven for M = 0. Now suppose inductively 
that M > and that (38), (39) have already been proven for all smaller values of 
M. Recall that c^j is a linear combination of ratios (36). By repeated applications 
of the quotient rule, (16), (33), and (32) we thus have that 

(i^<5)deg(-.J + -+deg(««,„)|J^^^^ . ..X^,^C^j{x)\ < CM,n, 

for all w <E W, X G $(-B) and 1 < ii, . . . , < ?^• Pulling this back by $ and 
throwing away some powers of K , we obtain 

for all w G t G B and 1 < zi,... ,zm < n. Expanding out the derivative 
operators using (22) and the induction hypothesis (38), (39) for smaller values 
of M, we obtain 

19,1^^2 . . .di^Cyjj{t)\ < Cm,«;(1 + -^IV^c^jI) 
and (38) follows if K is sufficiently large depending on M. 

Now we show (39). Let D be a constant co-efficient operator of order M. We take 
the Lie bracket of D with (27). From the Euler identity 

n n 
3=1 j=l 
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we obtain 

n n n n n n 

j=l k=l k=l j=l fe=l j=l 

We now estimate the terms on the right-hand side. Prom the inductive hypothesis 
(39), (28) and the Leibnitz rule we have 

\D{aia'])\<CM{l + r\V^A\). 

Wc can write [Y^. , Y,^. ] using the Jacobi identity as a linear combination of Y^ . 
From (37), the Leibnitz rule, the inductive hypotheses (38), (39) for smaller values 
of M, and the claim (38) for M that was just proven, we thus have 

|[7^,t,[y„.,y„j]| <CM(i + r|v^A|). 

Combining these facts together, we obtain 

n n 

I r{drDa1)dk + (M + 1) ^iDaDdk] < Cm{1 + r|V^A|). 
fe=i fe=i 

Taking co-efHcients and then letting i vary, we thus have 

\rdrDA +{M + 1)DA\ < Cm(1 + r\W^A\). 

Multiplying by r^, and letting D vary, this becomes 

\drir^+^'7^A)\ < CM(r^ +r^+'|V^^|). 

By Gronwall's inequality (and noting from the a priori smoothness of all quantities 
that j-'^+i^/M vanishes at the origin), we thus have 

and (39) follows. 

Step 4. Proof of (24). 

From the chain rule and the definition of the Y^ wc have the identity 

^^^^^^^ ^ det{K-\KSf'^^MX^„. . . ,K-\K6Y-^^^-)X^) 

det(l^j,... ,y^^) 

Prom (23), (34), (17) we thus have 

det(£>$)(i) ~ K-''\K{xo)\ for all t e B. 

To prove (24) it thus suffices to show that $ is essentially one-to-one on B. More 
precisely, we will be able to cover the large ball B by 0(1) small balls of radius ~ 1 
such that $ is one-to-one on each ball. 



To prove this, suppose for contradiction that there existed t,t' in B with t ^ t', 
\t - f'l < 1 and $(t) = $(t')- Prom (22), (39), (23) and the inverse function 
theorem we see that there exists a non-zero linear combination Y — X^il^i onYwi 
with a, = 0(1) such that t' = exp(y)t. This implies that $(t') = exp(X)$(t), 
where X := J^Zi aiK~\K6Y°^^'"''^ X^,.. However from (33) we see that X is a 
smooth non-vanishing vector field, with a norm of 0{5\ + 82) (for instance). 
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It is easy to see that for such vector fields, exp(X) cannot have any fixed points; 
indeed for such fields we differentiate the identity 

^ exp{tX){x) = X{exp{tX){x)) 

in time to obtain 

^cxp(tX)(x) = 0{{5,+5,)\j^eMtXKx)\) 
and from Gronwall's inequality wc thus have 

^ exp{tX){x) = X{x) + 0(((5i + 52)\t\\X{x)\) 

for \t\ < 1, and the claim follows if 5i + 62 is sufficiently small. Thus we obtain the 
desired contradiction. 



We remark that one can use Proposition 4.1 to show that 

\B{xo;Si,S2)\ ~ |A(a;o)| 

by using (24), (21) and (iterated versions of) (5); we omit the details. In particular 
we can prove a rigorous version of (6) (with I restricted to I). 



5. Necessity of the Newton polytope 



We now have enough machinery to prove the easy direction of Theorem 1.10. More 
precisely, we assume 1 < pi < p'2 < 00 are such that the associated point (ci,C2) 
defined by (7) is outside of the Newton polytope P. 

The (quarter-infinite) polytope P only has a finite number of vertices, all of which 
are of the form deg(/) for some collection of n-tuples / G VF". By choosing s 
sufficiently small, one can assume that all of these n-tuples are in I. 

Since {01,02) is outside of P, we may find a half-plane of the form 

{(xi,X2) ■■ aixi +a2X2 > 1} 

which contains P but docs not contain ci, C2. Since P is quarter- infinite and con- 
tained in the quadrant {{xi,X2) : Xi,X2 > 1} we may take < ai , 02 < 1; by taking 
e sufficiently small we may assume that e < ai, 02 and that aici -|- a2C2 <1 — s. 

We let < ^0 ^ 1 be a small parameter, and set 

6:= {duS2) := {S^\S^^). 
Observe from construction that 

for all {xi,X2) G P- In particular we have 
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We now apply the machinery of the previous section, with a;o := and 6 set as 
above. We set 

where -Bi/k is the ball of radius 1/K centered at 0. From (24) we have 

|O|~C^i|A(0)|. (41) 
On the other hand, from (21) we see that 

e*^^ Bi/K C B 
for all i = 1, 2 and \t\ < 1/K. Thus 

for all j = 1,2 and \t\ <1/K. In particular, from (24) we have 

I y e**^-^^T2|<C^|A(0)|. 

t:\t\<l/K 

Since Xj lies in the kernel of d-Kj , is bounded away from zero, and ttj is a submersion, 
we thus have 

Inserting this bound and (41) into (4) we obtain 

\m\<CK{\m\i6if'^''{\m\ihf''' 

which after some algebra and (7) becomes 

|A(0)| > CKi^^^'^^y 

But this contradicts (40) if we choose 5q sufficiently small. This concludes the proof 
of necessity for (ci, C2) to lie in the Newton polytope P. 



6. Sufficiency of the interior of the polytope: notation and 
preliminary reductions 

We now begin the proof of the more difficult direction of Theorem 1.10. For this 
direction we assume 1 < p\ < p'2 < oo is such that the exponent pair (ci,C2) 
defined by (7) lies in the interior of the polytope P, and our task is to prove (4) 
(the strong- type estimates then following by Marcinkiewicz interpolation). 

Wc may assume that the set V is contained within a I/if- neighborhood of the 
origin. In particular, if a is a Lipschitz function on V such that |a(0)| ~ 1, then 
\a{x)\ ~ 1 for all X G F (if K was chosen sufficiently large). 

Fix pi, P2, ci, C2, and let C F be a fixed set of positive measure. We define the 
quantities aj = for j = 1, 2 by 
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these are the analogues of the Sj in the previous section. Note that < aj <1/K 
(since V has diameter 0{1/K)). By the same algebra used in the previous section, 
we can rewrite (4) as 

M>al^a',\ (43) 

We adopt the periodic notation that aj+2 = ctj for all j. By symmetry we may 
assume that 

ai > 0:2- 



Intuitively, the proof of (43) runs as follows^. From (42), we expect that for "most" 
points Xo in O, one can flow using 6*^^ for some random \t\ < 1, and return to Cl 
with probability ~ ai. Similarly for e*"^^ and 0:2. In particular, if one considers a 
generic expression of the form 

e*"^" ...e*i^ia;o (44) 

then such a point should have a probability at least > a„ . . . ai of lying in ft. If 
this expression as a function of t = {ti, . . . ,tn) had good injectivity properties (in 
that its Jacobian was bounded away from zero), this would then give a significant 
improvement to (43), namely 

|f]|>a„...ai=ai("+^)/^J4n/2j^ 

This however is too good to be true, as the results of Section 5 already indicate. 
The problem is that the Jacobian of the map 

(ti,... ,t„) ^e*"^" ...e*i^ia;o 

can degenerate to zero on a complicated set (although the Hormander condition 
does ensure that this Jacobian does not vanish entirely). 

In the work of Christ[6], the operator of convolution with the curve {t, t^, . . . , t"~^) 
was considered. This curve has special algebraic properties (for instance, the Jaco- 
bian turns out to be a Vandermonde determinant), and one could obtain a large set 
of times (ii, . . . ,t„) where the Jacobian was reasonably large^ by combinatorics. 
However in our more general context the Jacobian has no special algebraic structure 
and the zero set is about as badly behaved as a generic algebraic variety. While 
algebraic varieties do have some structure which can be exploited (for instance, 
their intersections with other algebraic sets have a bounded number of connected 
components), it is not clear how a general set of times ti, . . . ,tn can avoid this zero 
set. 



Our argument follows the iteration ideas of Christ [6] (which in turn were inspired by the 
work of Bourgain, Wolff, and Schlag on KaJseya-type problems) adapted to the general vector field 
setting. Of course, the idea of iterating an operator supported mostly on one-dimensional sets to 
increase the dimension of the support is hardly new (see e.g. [4], [9], [5], [12]); our main innovation 

is the utilization of the concept of width introduced in the previous section. 

^Strictly speaking, the argument in [6] requires an iteration by 2n — 2 steps instead of n in 
order to locate a sub-n-tuplc of times which can avoid the zero set of the Jacobian. This additional 
complication is avoided in our approach by restricting our times to a central set of small width; if 
the times are allowed to range over long distances (much greater than their natural width) then 
one can give counter-examples to the above iteration scheme working if one only uses n iterations 
(Christ, personal communication). 
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To get around the possible algebraic complexity, we go back and do a preliminary 
pruning of the set of times ti, . . . ,tn which we use in the above argument. In 
particular, by using a simple stopping time argument we can restrict each time ti 
to a central set of fixed width (the central nature comes because the set of times is 
essentially a difference set), although it will cost us an epsilon in the exponents to 
do so. Once one has restricted the time sets in this manner then (44) is restricted to 
one of the two-parameter Carnot-Caratheodory balls constructed earlier. By using 
a rescaling adapted to the vector fields (specifically, we use the map $ constructed 
in Section 4) we can assume that these widths are close to 1, at which point one 
can iterate Lemma 7.3 below to give satisfactory lower bounds on the Jacobian. 
(This trick was inspired by the "two ends reduction" of Wolff [49]). The condition 
that (ci,C2) lie in the interior of P then comes naturally from the Jacobian of 
the rescaling transformation (i.e. from (24)), by computations similar to those in 
Section 5. 

To make the above argument rigorous we shall need some additional notation. Let 
B denote the estimate 

and write A w B if A ^ B and B A. HE and F are sets, we say that E is & 
refinement of F, or E ^F, \i E c Fand \E\ » 

To prove (43) it will suffice to show that 

\9.\>al'al^+^'''. (45) 

Indeed, if (45) held, then by making N large and then making e small, we obtain 
(43) with ci,C2 replaced by small perturbations of ci, C2. But since (ci,C2) is an 
arbitrary interior point of P, and ai, a2 < 1, we thus obtain (43) for all (ci, C2) in 
the interior of P as desired. 

It remains to show (45). This will occupy the remaining sections of the paper, after 
a digression on widths. 



7. An interlude on widths 

We now introduce the main innovation of this paper, that of a central one-dimensional 
set with a fixed width. 

Definition 7.1. Let 5 be a subset of [— C, C] with non-zero measure. We say that 
S is central with width w for some w > if 

\x\ < w for all X & S 

and 

\InS\<i\I\/wr\S\ (46) 

for all intervals I. 
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Thus central sets with width w are spread out somewhat evenly within an interval 
[—Cw, Cw] which is centered at the origin. It is essential that this interval is central 
(i.e. centered at the origin), and that there is absolutely no portion of S outside 
this interval, otherwise the iteration argument we use will not give a set contained 
in a small Carnot-Caratheodory ball. 

Note that central sets have diameter comparable to their width. In particular: 

Corollary 7.2. // one central set of width w is a subset of another central set of 
width w' , then w <w' . 

A central set S with width w is supported on an interval of length ~ w and is 
not concentrated on any smaller interval. This non-concentration gives such sets 
good properties when it comes to obtaining lower bounds of integrals of these sets. 
The idea of using such a non-concentration condition was inspired by the work of 
Wolff [49] on the Kakeya problem, in which he utilized a very similar "two-ends" 
condition to achieve a similar effect. 

In a later proposition (Lemma 8.2) we shall construct some central sets with some 
width w. In the remainder of this section, we show how the width property is 
useful. 

Lemma 7.3. Let P{t) he a polynomial of one real variable of degree d = 0(1) and 
||-P||oo ^1 on an interval [—C,C]. Let S be a central set in [— C, C] of some width 
< w -C 1. Suppose that we have the lower hound 

\ilyPiO)\>w"^ (47) 

for some j > and m > 0. Then, if w is sufficiently small, we can find a subset 
S' C S of measure 

\S'\ « \S\ 

such that 

\P{t)\ > w^^'^''^^ for all t e S'. (48) 

Observe how the hypothesis (47) only requires a lower bound at a single point, 
while the conclusion (48) yields a lower bound on a large fraction of the set S. 
Such a bound would not be possible without the non-concentration assumption 
(46) (unless one was willing to lose a power of l^l in (48), which would almost 
certainly lead to much worse exponents'" p\ , P2 for the positive results in Theorem 
1.10). 

Because of the unspecified power of w in (48) we will only be able to usefully apply 
this lemma when the set S has width close to 1. However we will be able to use 
the map $ developed in the previous section to rescale sets of small width to sets 
of ^vidth « 1 to resolve this problem. 

^Admittedly, if one was very careful then one might still be able to obtain near-optimal results, 
see e.g. [6]. However the general theory of lower bounds is far from optimal at this stage; see [3] 
for some recent progress. 
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Proof By considering the homogeneous functional 

|lP||:=max inf iP^'^^t)! 
" " k<d te[-C,C] ' ^ ' 

on the finite dimensional space of polynomials of degree d, we see that (47) implies 

\P^''\t)\>w"' on [-C,C] (49) 

for some k < d. Wc may assume k > 1. Consider the set of real numbers t such 
that P{t) = 0(ii;™+2''). This is contained in the union of at most 0{d) intervals, 
each interval has length at most O(w^) by (49). By (46), each such interval thus 
contains at most 0{w'^) of the set S. If wc then set S' to be the portion of S 
which is outside these intervals, we have \S'\ w \S\ (if w is sufficiently small) and 
|P(i)| Z on S" as desired. ■ 



8. Main argument 



We now give the rigorous proof of (45). Let Iq C I denote those elements of I for 
which A/(0) ^ 0. Since Iq is finite and non-empty, we may assume by continuity 
that Xi{x) ~ 1 for all x gV. In particular we have 

|A(^)I ^ Yl ^'^''^^^^ 

/elo 

for all a; G y and < ^i, ^2 -C 1. 

Choose £ > small enough so that Iq contains all the vertices of P. Since (ci,C2) 
lies in the polytope P and < ai, 0:2 < 1, we see from construction of P that 

/elo 

Thus to prove (45) it will suffice to show 

/elo 



From (42) we expect, heuristically, that for generic points in x G Q and t = 0(1), 
the point e*"^^ {x) has a probability aj of remaining in O. To make this precise 
we will need to pass from O to various refinements of fl, which we now construct. 

We first make a technical reduction to reduce the diameter of f2. By the pigeonhole 
principle (covering V by balls of radius 0(0:2^^)) one can find a subset Cl c ft 
which has diameter 0{a2^^) and is such that 

\n\>a^/''\n\. (52) 

Henceforth we fix this (l. 
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Definition 8.1. Let j be an integer. A j - sheaf U,' of width wj is defined to be a 
refinement CI' of Ct such that, for all x € Cl', the set 

{|iKl:e*^^(a;)en'} (53) 
is a central set of width Wj and measure ^ a^^^ Oij. 



This definition and the following lemma is very reminiscent of the restricted weak- 
type reductions used in, e.g. [6]. Our main innovation here is that the sets of times 
are central sets with fixed width; this will be crucial in placing (a large subset of) 
Q, inside one of the Carnot-Caratheodory balls used earlier. 

Lemma 8.2. For any refinement Q.' of SI and any integer j, there exists a refine- 
ment {Cl')j ofQ' such that {Sl')j is a j -sheaf of some width a^^^ oij ^ Wj < o^^^ ■ 



Proof Fix j, Cl' . It will be convenient to change co-ordinates so that E is locally like 
R", and Xj is just the constant vector field e„. In this case we can reparameterize 
Mj as R"~^, so that tt^ : R" — > R"~^ just becomes the Euclidean projection 

T^jixii • • • 5 Xn) = (iTl, . . . , Xn—1^- 

We can of course let Oe be the usual origin (0, . . . ,0). 

Consider the function {TTj)*x^'j the push-forward of XQ' by ttj. This function has 
norm \ fl'\, and is supported on a set of measure 

<\njm = \n\/a,<a-^/''m/aj. 

Because of this, we can find a set E in R"^^ such that (7rj)*XO' ~ a^^^aj on E 
and 



At this point one could attempt to set {Cl')j equal to fi' n tt~ (E), but we would 
not get the crucial property that (53) is central with a fixed width. To obtain this 
additional property we must refine further. 

Let X be any point in E C R"~^, and consider the set 

S{x) := {t : {x,t) e n'}. 

Prom the diameter restriction on Qq we see that this is a subset of [—Ca2^^,Ca2^^] 
with measure ^ c^^^ a^. 

Let I{x) be the dyadic interval of minimal length such that 

\I{x)r^S{x)\>\\I{x)nS{x)\■ 

if there are many intervals of this length, we choose l{x) arbitrarily. Let w{x) 
denote the length of l{x). 
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This interval is well-defined and must have length a^^'^aj ^ w{x) < (£1^ ■ Set 
S'{x) := I{x) n S{x). By construction we see that 

\S\x)\^\S{x)\>oFJ''<x^ (54) 

and that 

\I^S'{x)\<{\I\lw(x)r\S'{x)\ (55) 

for all intervals I. (Note that any interval / can be covered by 0(1) dyadic intervals 
of comparable length, and each of the dyadic intervals are either contained in /(a;) 
or are disjoint from it.). 



Since w{x) can only take 0(log(l/a2)) ~ 1 many values, there exists an a^^^ aj ^ 

S'{x)\ « / \S'{x% 
Je 

[ \S'{x)\ « / \S{x)\ = [ {nj).xn' ^ m « |f2|. 

JxeE:w(x)=wj Je Je 

We then choose such a wj and define 

{n')j ■= {{x,t) : x e E,w{x) = Wj,t G S'{x)}. 



Wj < a2^^ such that 



which by the previous implies that 



We now verify that (il')j is a j-sheaf of width Wj. Choose (xo,to) •= By 
construction, Iq lies in S'{xq), which is itself contained in an interval I{xq) of 
length Wj. Since the set of times (53) corresponding to (a;o,to) is just S'{xo) — to, 
we see from (54), (55) that (53) is indeed a central set with width Wj and measure 

^ a2^^ dj. (The all- important centrality condition follows since I{xo) has length 
Wj, so that S'{xq) — to is contained in the interval [—Wj,Wj].) ■ 



The above Lemma allows one to refine a set fl so that the tti fibers (for instance) 
have good properties. It is tempting to iterate this lemma so that the tt2 fibers 
are also good, but unfortunately the latter refinement can destroy some of the 
properties of the former. One could then try to iterate the lemma repeatedly, but 
the widths wi , W2 may change in doing so. Fortunately we can salvage matters by 
using a pigeonholing argument (dating back at least to [51]; we use the formulation 
in [35]. Somewhat similar ideas appear in [6]), if we are willing to lose a power of 

l/JV 

"2 : 

CoroUciry 8.3. There exists dyadic numbers 0.2^^ aj ^ p.j < 02^^ for j = 1,2 
and an increasing sequence 

Oo ^ Oi . . . ^ ji„ ^ 

such that for all < j < n, the set Qj is a j -sheaf whose width Wj satisfies 

cc2^^ tij ^ Wj < /Xj. Here we have adopted the convention that the subscripts of jii, 
/U2 are extended periodically. 
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Proof We first define a much longer decreasing sequence of refinements 

by setting n'-J^ = for all 1 < j < N^. (Note we are extending the 

construction in Lemma 8.2 periodically in j in the usual fashion). By construction 
we see that each fl^^") is an j-sheaf of width w'^^^ for some a^^^a^ ^ w^-^^ < 
Q!2^^. (Note that we are allowed to apply the lemma N"^ times because our implicit 
constants are allowed to depend on N). 

Since C we see from Corollary 7.2 that ^(^+2) < w^) for all 1 < j < 

N"^ — 2. Thus if we set A'' to be large enough, then by the pigeonhole principle one 
can find an even number n < Jq < N"^ and numbers /ii, /i2 (extended periodically) 

such that a^^^aj ^ /ij < o'^^^ and a^^'^ ^ w'-'-' < /ij for all jo — n < j < jo. 
The claim then follows by setting = n^-'""-') and wj = w^^°~^^ (note that jo — j 
and j always have the same parity). ■ 



Let /ij, ilj be as above. Since Uq is a refinement of il, it is non-empty. We fix a 
point xo e ^0- Define 61,62 by 

r -1/N 

and extend this periodically in the usual manner, 6j+2 = 6j. Then we have 
Oi^^'^ 6j ^ Wj < oty^ 6j for all j = 1, . . . , n, 
< ail5i < Q2 

and 

In particular we have 



For all < z < n define the map $i from a ball of radius 0(1) and center in R* 
to S by 

$i(ti,... ,ii) := e*^**^*...e*^*^^Ha;o), 
and let Tj c R' denote the set 

Ti := {t€lC : $j(ti, ... ,tj) G ^-i for all 1 < j < i}, 

with the convention that To = R° {()}, the singleton set consisting of the unique 
0-tuple. Observe that for each < i < n and t € Ti the set 

n{t) ■.= {u+i eR: {t,u+i) eT,+i} 

is a central set of measure ^ ai/6i and width Wi for some a^^^ « ^ 0^2^^ i 
indeed, the sets Ti{t) are just the sets (53) rescaled by 6^. 

Since the U are bounded (and this is the key place where we use the centrality of 
the set (53)), we see from (21) that 

^jiTj) c $(B) for all 1 < i < n 
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where $, B are as in Section 4; here we assume the radius of B to be sufficiently 
large. 

Now by construction, the set $„(T„) lies in il. By (24) we thus have 
However, we see that 

$-1 o = 

where maps a ball in R-' to B and is defined by 

and the Yi are the vector fields from Section 4. Thus we have 

|f]|>|A(xo)||*„(r„)|. 

In the next section we shall show the bound 

l*n(T„)|>ar^...?^. (56) 

Inserting this into the previous and using the fact that aj and 6j are periodic, we 
can write this as 

\n\ > a2^/^|A(xo)|(^)L("+i)/2J(^)L»/2J. 

Now for all I £ Iq, we observe that deg(/)i, deg(/)2 > n — 1. This is because all 
the vector fields Xyj in I must be distinct (otherwise A/ = 0), and apart from the 
vector fields Xi, X2, every other vector field Xyj has deg(w)i, deg(w)2 > 1- Since^ 

n-1 > [(n+l)/2j,[n/2j 

and aj/6j ^ 1, we thus have 

Ol O2 

for all / G Iq- Combining this with (50) we obtain (51) as desired. 

It remains to show (56). This will occupy the entirety of the next section. 

9. Conclusion of the argument 

We now prove (56). To simplify the notation we write f3j := cxj/Sj. Our situation 
is as follows. We have a point € V which is within 0(1/ K) of the origin 0. We 
have sets Tj € for ?' = 0, . . . , n, which have the structure 

T^+i = {{t:U+i) e R'+i : t G T,;U+i e nit)} 

for i = 0, . . . ,n — 1, with To = R" = {()} and for each t G Ti, the set Ti{t) is a 
central set of width 

^Notice the "slack" in the argument here. What this is saying is that when one has a two-ends 
condition on the set of times (53), then the lower bounds on \ improve substantially. This is 
consistent with the experience with the Kakeya problem in e.g. [49]. 
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and Ti{t) has measure 

kiWI^a^^'^A- (57) 
Prom our upper bound on Wi we see that 

TiC[-r,rY 

for all < i < n and some r ~ a^^^ , which we fix. 
From (57) and Fubini's theorem we have 

|r,+i|>a^/^A|r,| 

and hence 

|T„|>a^/^/3i.../J„. 
On the other hand, we wish to prove that 

l*n(T„)|>a^/^/3i.../3„. (58) 
It will therefore suffice to find a subset of T„ with 

K\>a^^''p,...f3n (59) 
such that we have the Jacobian bound 

det(L»*„)(OI ^ a^^^ for aU t £ T^. (60) 

Indeed, if we can find such a T^, then we cover by balls of radius a^^^^ for some 
large constant Ci . If Ci is large enough, then '^n is one-to-one on every one of the 
balls that intersects T^^, thanks to (60), the smoothness of (recall from Section 4 
that the arc smooth uniformly in xq, Si, 62), and the inverse fmiction theorem. 
By the pigeonhole principle one of these balls must contain at least a2^^^^ of the 
set T^, and the claim (58) then follows from (60), (59) and the change-of- variables 
formula. 



It remains to construct a set (59). This will be achieved by the width property 
of the Ti{t) and several applications of Lemma 7.3, but first wc need to control 
some derivative of det(£'\l/„(<)). We shall do so using the Hormander condition 
assumption, by invoking the following quantitative analogue of Frobenius's theorem 
(which can also be derived from the arguments in [8]): 

Lemma 9.1. There exists a multi-index (3 = (/3i, . . . ,/3„) with \(3\ < C{Io,n) such 
that 

\d^det{D^n)U=o\>C],\ 



Proof We shall prove a more general statement. For any 1 < i < n, let Jj denote 

the i-form 

Ji{t) :=dt,^i{t)A...Adu^i{t). 

defined for all t in a ball in R\ Wc shall show that for each i there exists a multi- 
index /3* with 1/3*1 < C(/o, n) and a small number < <C 1 depending on K such 
that 

\dfji\t=o\>Oi. (61) 
The Lemma will then follow by applying (61) with i = n. 
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The idea of the proof can be sketched as foUows. InformaUy, the statement (61) 
asserts that the vector fields Xi,. . . ,Xi are non-degenerate, and so the iterated 
map fills out a large subset of a j-dimensional manifold. If Xj+i is transverse 
to this manifold then one can induct and obtain (61) for i + 1. Similarly if Xi+i is 
tangent to the manifold to some (bounded) finite order. The only remaining case is 
if Xj+i is tangent to the manifold to some extremely high order. But this will imply 
that the iterated Lie brackets of Xj+i and X^ (for instance) are also tangent to the 
manifold, and this will eventually contradict the Hormander condition assumption. 

We now return to the rigorous proof of (61). For i = 1 this is clear just by taking 
,9 = and using the non- vanishing of Yi. Now suppose that i > 1, and the claim 
has already been proven for i — 1. 

Let < 0i <C 1 be a small number depending on K, Oi-i to be chosen later, let 
A — A[Io,n) be a large integer to be chosen later, and let e = e(^) be a small 
number to be chosen later. 

By induction hypothesis and the smoothness of Jj-i, we can find a ball -Bj_i C R*~^ 
of radius > C~l,_^ and at a distance 0(e6'i_i) from the origin such that 



(cf. the arguments in Lemma 7.3); the constant C can depend on /3i_i. By shrinking 
if necessary we may then assume that is one-to-one on The set 

^'i_i(i3i_i) is thus an open subset of a smooth i — 1-dimensional manifold in B. 
Observe that the i — 1-form Ji-\{t) is tangent to ^'j_i(i?i_i) at ^i-i{t) for every 
t G 

Suppose for contradiction that we have 



\dUi\t=o\<Oi 

for all multi-indices /? with |/3| < A. Then by Taylor expansion we have 

\Ji{tu . . . ,ti-iM < CA9i + CAc^et, 

whenever ti,. . . , = 0{e6i-i), and more generally that 

\dfji{h, . . . ,ti-i, 0)1 < CAOi + C^(e^,_i)^-I^l 

whenever |/3| < A. On the other hand, by definition we have 

Ji{ti, ■ ■ ■ ,ti_i,0) = Ji-i{ti, . . . ,ti-\) A dti^i{t\, . . . ,ti-\,ti)\ti=Q 
= Ji-i{ti,... ,ti-i) AYi{^i_i{ti,... ,ti-i)). 

Because of this and (62), we see that for every t G -Bj-i that we can write 

y,(*,_i(i)) = y*«"9^"*(*,_i(i)) + o{CA.g^_,.,e, + CA{e9,^i)^-^) 

where y*""9'="*(^'i_i(f)) is tangent to ^'i_i(B,_i) at *i-i(t). Since ^i_i{Bi_i) is 
a smooth manifold, we may thus write 



\Ji-i{t)\ > e^9f_^ for all t e B, 



'i-l 



(62) 




+ Y' 



error 
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on a Cg/';^ -neighborhood of \E'i_i(i?i_i), where y*<*"S'="* is a smooth vector field 
which when restricted to is tangent to and F^'"'""'' is an- 

other smooth vector field such that 

for all multi-indices /? = . . . ,/3„) with |/3| < A. 
Prom the identity 

Fi_i(*i_i(t))=at,_,*i_i(t) 

we see that Y^-i is also tangent to when restricted to 

Thus any Lie bracket combination of and y*«"s«"* must also be tangent to 
when restricted to 

Now let w be any word with deg(w)i + dcg(w7)2 ^ A. The vector field YJ^, is 
^i-deg(«,)i-deg(«;)2 ti^es a Lie bracket combination of Yi and ^2- One of these 
vector fields is Fj-i, the other is F,, which can be split into y*o"sent ^^^^ yerror 
From the above discussion we thus see that when restricted to is 
equal to a vector field tangent to plus an error term of magnitude at 

most 

CK,A,0i-i,e6i + CK,A^Oi-i. 

Applying this with Y^^ ,.. . Y^^ and taking wedge products we obtain 

If we pick e sufficiently small depending on A and K and 6i sufficiently small 
depending on A,K, 9i-i, e we see that this bound contradicts (23), and we are 
done. ■ 



We can now quickly construct a set with the properties (59), (60), which will 
finish the proof of (56) and thus of Theorem 1.10. 

By a Taylor expansion we may write 

det(D'fnm=P{t) + 0{a^) 

on t G [— r, r]", where P is a polynomial of degree 0{N^) = 0(1), ||-P||oo ^ 1 and 
|9"P(0)| > (the index a = (ai,... ,a„) with bounded coefficients is as in 
Lemma 9.1). It therefore suffices to construct T„ satisfying (59) so that (60) holds 
with P{t) in place of det(D*n)(f). 

We apply Lemma 7.3 with S equal to ti(()). This gives us a subset r{(()) of ri() 
with measure 

k((())|^a^/^/3i 

such that 

|af/...a«"P(ti,0,...,0)|>a^/^ 

for alHi eT((()). 
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Fix ti € '''{(()). We now apply Lemma 7.3 with S equal to T2(ii). This gives us a 
subset T2{ti) of T2{ti) with measure 

such that 

\dg^...dl-P{tut2,0,.-. ,0)1 ^a^/"^ 

for all t2 e 7-2(^1). 

We contimic in this manner, defining sets T^{ti_i) recursively. Note that since our 
constants C are allowed to depend on n that we will not have trouble iterating the 
above scheme n times. If we recursively define Tq = {()} and 

7^+1 := e R'+i : t & T!;U+^ G r/(t)} 

we thus sec that obeys (59) and (60) as desired. This completes the proof of 
Theorem 1.10. 

10. Remarks 

• The estimate (51) can be rewritten using (6), and the fact that N was 
arbitrary, in the rather appealing form 

\n\>\B{0;ai,a2)\. (63) 

In other words, if one fixes the thicknesses aj = \Q\/\TTj{Q)\ of a set 
in a small neighbourhood of 0, then the volume of the set fl is essentially 
minimized when f7 is a two-parameter ball centered at the origin. One 
may thus think of our main result as a kind of two-parameter isoperimetric 
inequality in the spirit of [26]. 

• It would be nice if the ^ in (63) could be replaced by a >, as this would 
give restricted-type boundedness on the entire boundary of the Newton 
polytope P (bringing these results in line with those in, e.g. [6]). If one 
wishes to adapt the above argument to do this, there seem to be two major 
obstacles. The first is that the argument requires more and more regular- 
ity on (S,Mi, M2,7ri,7r2,0E) as one approaches the boundary of P. This 
particular difficulty can be avoided by restricting one's attention to model 
cases, such as when S is a nilpotent Lie group and Xi, X2 are left-invariant 
vector fields. The other major difficulty is that one cannot refine the set 
of times (53) to a central set of a fixed width without losing an e in the 
exponents. We do not know how to get around this loss. 

• One can be more ambitious still, and try to obtain strong-type boundedness 
at the endpoints of P. This is already difficult to do (at least if one only uses 
geometric combinatorics as in this paper) in such model cases as convolution 
with a compactly supported measure on the parabola {{t, f^)} in R^. In this 
model case the endpoint mapping is L"^/^ , but a naive combinatorial 
argument only gives Li^/'^'^ L^'°° (however see [30]; also see [6] for more 
discussion on this). Of course, one can obtain the strong- type endpoint in 
this case if one is willing to use such tools as complex interpolation and 
smoothing estimates (see e.g. [25]). It may also be possible to do so by 
pure geometric combinatorics, but one probably has to control the extent to 
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which various two-parameter balls of varying radii can overlap each other. 
In particular, a two-parameter covering lemma of some sort may be needed. 
It seems likely that one could extend these results (with the aid of the results 
in [8] or [12]) to obtain good {LP^,LP^) mapping properties for fractional 
integral operators such as 

for < a < 1, where for each x, the map t j(x,t) is a smooth param- 
eterization of the curve jx of the type discussed in the introduction, and 
the Hormander condition obeyed. In fact, if 7(2;, 0) = a: and a = 1 — ^ 
with k odd, then one easily sees (from a simple change of variables) that Ta 
falls within the class of operators described by (2) where the corresponding 
vector fields Xi and X2 are now linearly dependent at the origin. 
The methods here shoiild probably be able to give an alternate way to 
obtain the boundedness of maximal operators of the form 

Mf{x) :=sup2'= / /(7(a;,i))a(2H) dt; 

k>0 J 

in particular, it is now quite possible that one could prove boundedness 
of such operators using purely geometric methods (avoiding the Fourier 
transform, as is used in e.g. [8]). This is of course not the only type 
of maximal function one could study; more generally one could consider 
operators of the form 

M/(x) :=sup j fi^ix,t,r))ait) dt 

for some parameter r; model examples here include the circular maximal 
function and the Kakeya maximal function (see [52] for a survey of these 
operators). It seems difficult however to adapt our techniques to these 
operators except in very low dimensions; one would have to linearize the 
parameter r as r{x), and this begins to destroy the differentiability prop- 
erties of the iterated flow map when n > 3. On the other hand, Christ 
and Erdogan have recently used geometric combinatorics ideas of a similar 
flavor to those in [6] and in this paper to obtain sharp mixed-norm esti- 
mates for certain classes of x-ray transforms; see [7] . However, the general 
question of understanding maximal operators of this type is still far beyond 
our current technology. 

It is also tempting to try to use these techniques to prove smoothing 
estimates (i.e. mapping to LJ^ rather than to i'). However this problem 
is much more difficult due to the presence of cancellation, and does not have 
a nice geometric interpretation such as (63). Indeed this problem is quite 
difficult even for model cases such as convolution with the curve (Lt^,t^), 
as it is related to the local smoothing conjecture (see [34]). On the other 
hand, Wolff [53] has recently combined geometric combinatorics techniques 
with Fourier methods to obtain some progress on these types of problems. 
However, with respect to curves in the plane, Seeger [47], [46] has obtained 
sharp (up to endpoints) to estimates. 
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• Another possible generalization would be to higher-dimensional averages, 
or to asymmetric averages in which the manifolds Mi and M2 have dif- 
ferent dimensions. In our language, this would mean that tti and 1^2 now 
have corank ki and k2 which are possibly greater than 1, and we would 
replace the single vector field Xj by a family of commuting vector fields 

1 k ■ . . . . 

Xj .... ; Xj\ While it is possible to use this machinery (perhaps combined 
with the techniques in [8]) to get some non-trivial {U',L'') mapping result, 
it seems difficult to obtain sharp results, because there seems to be no sat- 
isfactory analogue of the notion of width in more than one dimension. See 
Seeger [47] where nontrivial (L^, L') as well as (L^, L^) results are obtained 
when S is a hypersurface in Mi x M2 (i.e., dimS = ki+k2 + 1). 

• It is crucial in our arguments that the sets Ti{t) of times are central, so that 
the times ti are always close to 0. This allows us to restrict the set (pniTn) to 
lie inside a small two-parameter Carnot-Caratheodory ball, whose geometry 
can be well controlled by the machinery of Section 4. If we allowed Ti{t) 
to wander far away from the origin, then it would in fact be impossible to 
obtain the estimate 

|$„(T„)|>afar (64) 

for the desired values of ci, C2, even in the model case when the Ti(t) are all 
intervals of length ai and we are considering convolution with {t, . . . , t"~^) 
for some n > 5 (see [6] for some further discussion of this issue; this was also 
independently observed by Grcenblatt). The point is that we lose control 
of the geometry if one flows too far along one or more vector fields. On the 
other hand, failure of the estimate (64) docs not imply that failure of the 
lower bound on \ because ^n{Tn) may only occupy a small portion of fl. 
Our particular selection method for T„ (using the machinery of j'-sheaves) 
is thus essential to ensure that $„(T„) does not degenerate to only a small 
fraction of CI. 
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